Using topological data analysis for building Bayesan neural networks
Annotation
For the first time, a simplified approach to constructing Bayesian neural networks is proposed, combining computational efficiency with the ability to analyze the learning process. The proposed approach is based on Bayesianization of a deterministic neural network by randomizing parameters only at the interface level, i.e., the formation of a Bayesian neural network based on a given network by replacing its parameters with probability distributions that have the parameters of the original model as the average value. Evaluations of the efficiency metrics of the neural network were obtained within the framework of the approach under consideration, and the Bayesian neural network constructed through variation inference were performed using topological data analysis methods. The Bayesianization procedure is implemented through graded variation of the randomization intensity. As an alternative, two neural networks with identical structure were used — deterministic and classical Bayesian networks. The input of the neural network was supplied with the original data of two datasets in versions without noise and with added Gaussian noise. The zero and first persistent homologies for the embeddings of the formed neural networks on each layer were calculated. To assess the quality of classification, the accuracy metric was used. It is shown that the barcodes for embeddings on each layer of the Bayesianized neural network in all four scenarios are between the corresponding barcodes of the deterministic and Bayesian neural networks for both zero and first persistent homologies. In this case, the deterministic neural network is the lower bound, and the Bayesian neural network is the upper bound. It is shown that the structure of data associations within a Bayesianized neural network is inherited from a deterministic model, but acquires the properties of a Bayesian one. It has been experimentally established that there is a relationship between the normalized persistent entropy calculated on neural network embeddings and the accuracy of the neural network. For predicting accuracy, the topology of embeddings on the middle layer of the neural network model turned out to be the most revealing. The proposed approach can be used to simplify the construction of a Bayesian neural network from an already trained deterministic neural network, which opens up the possibility of increasing the accuracy of an existing neural network without ensemble with additional classifiers. It becomes possible to proactively evaluate the effectiveness of the generated neural network on simplified data without running it on a real dataset, which reduces the resource intensity of its development.
Keywords
Постоянный URL
Articles in current issue
- Modeling the illumination of the Earth’s surface to select the operating modes of the radiation source
- Luminescent dynamics of oxygen oxidation of Viburnum opulus L. in chitosan solutions with gold nanoparticles
- Dynamic surface control for omnidirectional mobile robot with full state constrains and input saturation
- Dual-wavelength digital holographic interferometry for technical applications
- Structural analysis of ZrO2 and TiO2 nanoparticles
- Investigation of polyvinyl butyral coatings with carbon quantum dots on the characteristics of silicon solar cells
- Numerical algorithm for finding the optimal composition of the reacting mixture on the basis of the reaction kinetic model
- Raman spectroscopy of nanocomposites ZnO/ZnS and ZnO/ZnSe obtained by solvothermal-microwave synthesis method
- Emotion analysis of social network data using cluster based probabilistic neural network with data parallelism
- Assessing the possibility of using the method of image decomposition based on topological features to reduce entropy during image compression
- Implementation of neural networks in the method of multilevel component circuits
- Fuzzy logic controller algorithm for placing files in a data storage system
- Personalization of convolutional neural networks within the stress detection task using heart rate variability data
- Method of modeling viscoelastic properties of oriented polymer materials using multi-barrier theory
- Design of microstrip patch antenna using Fennec Fox optimization with SSRR metamaterial for terahertz applications
- Algorithm for promptly maintaining the temperature regime of power amplification units of the radar transmitting complex based on a thermal model
- Convective heat transfer and hydrodynamics of flow at the endwall around a turbine blade under the influence of a magnetic field
- Methods of contactless registration of information signals for the audit of information security of power supply systems and networks
- Parameter estimation of permanent magnet synchronous motor
- Internal memory data protecting problems of the Renesas microcontrollers